In-database connected component analysis
نویسندگان
چکیده
We describe a Big Data-practical, SQL-implementable algorithm for efficiently determining connected components for graph data stored in a Massively Parallel Processing (MPP) relational database. The algorithm described is a linear-space, randomised algorithm, always terminating with the correct answer but subject to a stochastic running time, such that for any ǫ > 0 and any input graph G = 〈V,E〉 the algorithm terminates after O(log |V |) SQL queries with probability of at least 1− ǫ, which we show empirically to translate to a quasi-linear runtime in practice. Monash University, Melbourne, Australia. Email: [email protected] Monash University and Telstra Corporation, Melbourne, Australia. Email: [email protected] UBS, Zürich, Switzerland. Email: [email protected]
منابع مشابه
Connected Component Based Word Spotting on Persian Handwritten image documents
Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...
متن کاملReview and comparison of User Interface Characteristics of (Springer, Elsevier, Ebsco, ISI(WOS) and Ovid) as Perceived by University of Tehran Users
Background and Aim: The present investigation intends to compare and review various user interfaces from user standpoint and to ascertain its linkage with user satisfaction. Method: The research incorporated a descriptive survey of University of Tehran graduate student body. Using a targeted sampling, graduate students from the faculties of chemistry and Biology were selected. The instruments u...
متن کاملThe Use of a Selective Database Technique in Order to Recover the Spectra of a Series of Acrylic Paints by the Principle Component Analysis
A procedure for an efficient recovering of reflectance spectra of Acrylic paint samples from CIE tristimulus color values is described. By fixing a certain criteria based on color difference value, the proposed technique preliminarily selects a series of suitable samples from a main dataset containing the reflectance values of a series of different Acrylic paint samples, based on the color ...
متن کاملTowards Text Recognition in Natural Scene Images
In this paper, we propose a novel methodology for text detection in natural scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully processes natural scene images having shadows, non-uniform illumination, low contrast and large signaldependent noise. Conn...
متن کاملText Detection in Indoor/Outdoor Scene Images
In this paper, we propose a novel methodology for text detection in indoor/outdoor scene images. The proposed methodology is based on an efficient binarization and enhancement technique followed by a suitable connected component analysis procedure. Image binarization successfully process indoor/ outdoor scene images having shadows, non-uniform illumination, low contrast and large signal-depende...
متن کاملModeling and Availability Analysis of Internet Data Center with various Maintenance Policies
In this paper, the authors have focused on the stochastic analysis of an internet data center (IDC), which consists of a database main server connected to a redundant server. Observing the different possibilities of functioning of the system, analysis has been done to evaluate the various reliability characteristics of the system. The system can completely fail due to failure of redundant serve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1802.09478 شماره
صفحات -
تاریخ انتشار 2018